2025-04-21 09:06:05.AIbase.17.3k
OpenAI's o3 Model Test Scores Questioned; Actual Performance Falls Far Short of Claims
OpenAI's recently released o3 AI model has sparked controversy over its benchmark test performance. While OpenAI confidently claimed in December that the model could correctly answer over a quarter of the highly challenging FrontierMath math problems, this assertion starkly contrasts with recent independent test results. The Epoch Institute's independent testing revealed the model achieved only a 10% success rate, significantly lower than advertised.